CDS
Accession Number | TCMCG075C11784 |
gbkey | CDS |
Protein Id | XP_007040945.2 |
Location | join(36334834..36334961,36336148..36336451,36336623..36336691,36336775..36336912,36337019..36337136,36337667..36337749,36337874..36337969,36338574..36338742,36338825..36338967,36339048..36339134,36339238..36339423,36339913..36340080,36341779..36341899,36342019..36342088,36342479..36342566,36342671..36342791,36343245..36343438,36343531..36343639,36343847..36344002,36344366..36344468,36344859..36345032,36345559..36345704,36346013..36346206) |
Gene | LOC18606966 |
GeneID | 18606966 |
Organism | Theobroma cacao |
Protein
Length | 1054aa |
Molecule type | protein |
Topology | linear |
Data_file_division | PLN |
dblink | BioProject:PRJNA341501 |
db_source | XM_007040883.2 |
Definition | PREDICTED: alpha-glucosidase 2 isoform X1 [Theobroma cacao] |
EGGNOG-MAPPER Annotation
COG_category | G |
Description | Belongs to the glycosyl hydrolase 31 family |
KEGG_TC | - |
KEGG_Module | - |
KEGG_Reaction |
R00028
[VIEW IN KEGG] R00801 [VIEW IN KEGG] R00802 [VIEW IN KEGG] R06087 [VIEW IN KEGG] R06088 [VIEW IN KEGG] |
KEGG_rclass |
RC00028
[VIEW IN KEGG] RC00049 [VIEW IN KEGG] RC00077 [VIEW IN KEGG] |
BRITE |
ko00000
[VIEW IN KEGG] ko00001 [VIEW IN KEGG] ko01000 [VIEW IN KEGG] |
KEGG_ko |
ko:K01187
[VIEW IN KEGG] |
EC |
3.2.1.20
[VIEW IN KEGG]
[VIEW IN INGREDIENT] |
KEGG_Pathway |
ko00052
[VIEW IN KEGG] ko00500 [VIEW IN KEGG] ko01100 [VIEW IN KEGG] map00052 [VIEW IN KEGG] map00500 [VIEW IN KEGG] map01100 [VIEW IN KEGG] |
GOs | - |
Sequence
CDS: ATGGATATATCGGTAGACCTGAGCAGCAGAATCAGGAGCGGAGGGGCCTTGGAATTATGTTATTGTCACTACTTATGTAATCGTCCTCGTTATTCTCCACCACCATCAAGAAGAGAAGTTCTCTACAGAGGCTGCCATGCAGAAAGGAGTTACAATAAGCTACTTAGTGGGAGAAAGCTAATGTCTAAAATGGCTAGTTCTGAAGTGAAGGAAGCAGCATCAGACTCCACTGCTGGGAAAATGATATTTGAGCCCATCTTGGAGGATGGAGTTTTCCGGTTTGATTGCTCTGCAAACGATAGAGATGCAGCATATCCAAGTCTTTCATTTATGAATAGCAACGACAGGGATGTACCAATCATGAGTAACAAGGTTCCTTTGTATATCCCTTCTTTTGAGTTCCTTTTGGGACAGCAGCTTGTCAAACTTGAGCTTCCTGTTGGTACCTCATTTTATGGAACTGGAGAAGTTAGCGGGCAGCTAGAGCGAACAGGAAAAAAAGTTTTTACATGGAACACAGATGCATGGGGTTATGGTCCTGGAACTACATCCTTGTACCAATCACATCCTTGGGTGCTGGCTGTTCTTCCAAATGGGGAGGCATTGGGAATTCTAGCTGACACGACAAGGCGCTGTGAGATTGATCTAAGGATCAAATGCAGAATTCAGTTTAATGCTCCAGCCTCATTTCCTGTTATTACATTTGGTCCATTTCCTTCACCCAGTGCAGTTTTGACATCTTTATCTCATGCAATTGGAACTGTTTTTATGCCTCCAAAATGGTCTTTAGGCTATCACCAATGCCGTTGGAGCTATGACTCTGAGGAGAGAGTTCTTGAGGTTGCAAGAAAGTTTCGGGAGAAGGGTATACCTTGTGACGTGATATGGATGGATATCGATTACATGGATGGCTTTCGTTGTTTCACTTTTGACAAGGAGCGTTTCCCAGATCCAAAGTCTTTGGTGAAAGATCTCCACCATATTGGTTTCAAGGCAATATGGATGCTTGACCCAGGGATTAAACATGAAAAGGGTTACTTTGTTTATGATAGTGGGACTGAACATGATGCTTGGATACAGGAAGCAAATGGGATGTATTTTGTTGGGGATGTGTGGCCAGGGCCTTGTGTTTTTCCTGATTTTACACAGTCAAAAATTCGTTCTTGGTGGGCGAATTTAGTTAGAGATTTCATTTCTAATGGTGTCGATGGTATATGGAATGATATGAATGAGCCAGCTATTTTTAAGGCTGTAACAAAAACAATGCCTGAGAGCAACATTCACCGGGGTGATAATGAACTTGGTGGTCATCAAAGTCATGCACACTATCACAATGCATATGGGATGCTAATGGCAAGATCAACTTATGAAGGGATGGAACTAGCTGATAAAAGAAAGCGCCCCTTTGTTCTAACCAGAGCTGGATTTATTGGGAGTCAAAGGTATGCTGCAATGTGGACAGGAGATAACCTTTCCAATTGGGAGCACCTTCACATGTCCATCTCCATGGTACTTCAATTGGGCCTAAGTGGTCAGCCCCTATCAGGACCTGATATTGGTGGGTTTGCTGGAAATGCAACACCCAAACTTTTTGGACGTTGGATGGGCTTTGGTGCTATGTTTCCATTCTGTCGTGGGCACTCAGAGACTGACACCATTAACCATGAACCATGGTCATTTGGTGAAGAGTGTGAAGATGTGTGTCGCCTTGCATTGAGAAGGCGTTATCGCCTCATACCACACATATACACTCTTTTTTATATGGCACATACTAGAGGTACTCCTGTGGCAACTCCTGCATTTTTTGCTGATCCTAAAGATCCCAATTTGAGGACGCTTGAGAGTTGCTTCCTTTTGGGCCCACTTTTAGTCTATGCAAGCACGATGCCAGATCTGGGGTCTGATAAATTGCAGCTCTTGTTGCCTAAAGGAATTTGGTTGAGCTTTGATTTTGACGATTCGCATCCGGATTTACCAGCTTTATATTTGCAAGGTGGATCTATTATACCTGTGGGTCCTCCTCTTCAGCATATTGGTGAATCTAATCCATCAGATGATTTAACACTTATCCTAGCTTTAGACAATTATGGGAAAGCTGAAGGTGTTCTCTTTGAAGATGATGGTGATGGATATGGATTTACTAAAGGTGAATATCTTTTAACTCACTATGTTGCGGAGCTTAAATCTTCAGTGATCACTGTCAGAATATCTGAGACTAAAGGAGTATGGAAGAGGCCAAACCGTCGTCTACATGTTCAACTCTTGATTGGTGAAGGTGCAATGCTTGATGCGTGGGGCATAGATGGAGAGGTTCTACAAATAGAAATGCCTTCTGAAACTGAGGTGTCTAAGCTGATATCTACTAGAAAAGTGCATAATAAAATGCATCTAGAAAGCGTTAAGCTTATTCCAAATGTGGAAGATGTTTCTGGACACAAGGGAGGAGAACTTTCAAGGACTCCGATTGAGCTGGAAAATGGTGATTGGTCACTGCAAATAGTACCCTGGATTGGGGGTAGAATTATTTCCATGGTTCATGTTCCCTCAGGAAGACAATGGCTTCATAGCAGGGTTGAAATTAATGGATATGAAGAGTATGGTGGTACTGAGTACCGGTCTGCTGGATGTTCTGAGGAATACCATGTCATCCAGAGAGATGTTGAGCATGCAGTAGAGGAGGAATCAGTTCTGTTAGAAGGTGATATTGGTGGTGGCTTAATCCTTCAGCGGCAGATAACCATACCCAAAGATAATCCAAAGGTTTTCCGGGTTGAATCTAGTATTTTGGCTCGCAAAGTTGGTTCTGGTTCTGGTGGATTTTCAAGGCTGGTTTGCTTGAGAGTTCATCCAACTTTCTCTCTCTTGCACCCAACAGAAAGTTTTGTAGCATTTACCTCCATTGATGGCTCTAAGCAGGAAGTTTGGCCAGAATCTGGAGAACAACTTTATGAAGGGAATCTTCTGCCTAATGGTGAATGGATGCTCGTTGATAAATGTCTTGGTCTGGGGCTTATCAATCGATTTAATGTTCGTGATGTCTATAAGTGCCTCATCCACTGGGGAACAGGGACAGTAAACTTGGAGCTGTGGTCTGAAGACCGTCCTGTTTCTAAACAATCACCTCTTAGGGTATTTCACGAGTATGAGGTGATGGAAATCCCATAA |
Protein: MDISVDLSSRIRSGGALELCYCHYLCNRPRYSPPPSRREVLYRGCHAERSYNKLLSGRKLMSKMASSEVKEAASDSTAGKMIFEPILEDGVFRFDCSANDRDAAYPSLSFMNSNDRDVPIMSNKVPLYIPSFEFLLGQQLVKLELPVGTSFYGTGEVSGQLERTGKKVFTWNTDAWGYGPGTTSLYQSHPWVLAVLPNGEALGILADTTRRCEIDLRIKCRIQFNAPASFPVITFGPFPSPSAVLTSLSHAIGTVFMPPKWSLGYHQCRWSYDSEERVLEVARKFREKGIPCDVIWMDIDYMDGFRCFTFDKERFPDPKSLVKDLHHIGFKAIWMLDPGIKHEKGYFVYDSGTEHDAWIQEANGMYFVGDVWPGPCVFPDFTQSKIRSWWANLVRDFISNGVDGIWNDMNEPAIFKAVTKTMPESNIHRGDNELGGHQSHAHYHNAYGMLMARSTYEGMELADKRKRPFVLTRAGFIGSQRYAAMWTGDNLSNWEHLHMSISMVLQLGLSGQPLSGPDIGGFAGNATPKLFGRWMGFGAMFPFCRGHSETDTINHEPWSFGEECEDVCRLALRRRYRLIPHIYTLFYMAHTRGTPVATPAFFADPKDPNLRTLESCFLLGPLLVYASTMPDLGSDKLQLLLPKGIWLSFDFDDSHPDLPALYLQGGSIIPVGPPLQHIGESNPSDDLTLILALDNYGKAEGVLFEDDGDGYGFTKGEYLLTHYVAELKSSVITVRISETKGVWKRPNRRLHVQLLIGEGAMLDAWGIDGEVLQIEMPSETEVSKLISTRKVHNKMHLESVKLIPNVEDVSGHKGGELSRTPIELENGDWSLQIVPWIGGRIISMVHVPSGRQWLHSRVEINGYEEYGGTEYRSAGCSEEYHVIQRDVEHAVEEESVLLEGDIGGGLILQRQITIPKDNPKVFRVESSILARKVGSGSGGFSRLVCLRVHPTFSLLHPTESFVAFTSIDGSKQEVWPESGEQLYEGNLLPNGEWMLVDKCLGLGLINRFNVRDVYKCLIHWGTGTVNLELWSEDRPVSKQSPLRVFHEYEVMEIP |